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"  A  multivariate  directed  graph  consists  of  a  set  of  g  nodes,  and 
a  family  of  directed  arcs  (one  for  each  relation)  connecting  pairs  of 
nodes.  Such  multivariate  directed  graphs  provide  natural  representa¬ 


tions  for  social  networks.  In  this  paper 


methods  to  analyse 


a  network  of  73  organizations  In  a  Midwest  American  community  linked 

»s  dt1*' 

by  three  types  of  relations:  Information,  money,  and  support.  The 

A 

resulting  data  set,  described  by  Galasklewlcz  and  Marsden  (1978), 

Involves  3  x  73  x  72  «  15,768  possible  arcs  or  "observations*.  Ate.  j> 

describes  a  class  of  stochastic  logl inear  models  for  multivariate  directed 
graphs,  demonstrate  how  they  can  be  fit  to  the  data  using  generalized 
Iterative  scaling  of  Darroch  and  Ratcliff  (1972),  and  explain  the  connection 
between  these  models  and  variants  on  standard  logllnear  models  for  multi¬ 
dimensional  contingency  tables  discussed  by  Bishop,  Flenberg,  and 

tT 

Holland  (1975).  Vie- also  considered  disaggregation  of  the  organizations 
Into  sub-groups,  and  demonstrate  how  to  adapt 4MHB»mode Is  to  explore 
the  Intra-  and  Inter-group  relationships.  These  methods  generalize 
research  of  Holland  and  Lelnhardt  (1980),  who  develop  a  model  for 
dyadic  relationships  in  univariate  directed  graph  data.  The  paper 
Includes  a  detailed  analysis  of  the  Galasklewlcz-Marsden  data.  - 


T  " —  ' 
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1.  Introduction 

Although  this  conference  Is  entitled  "Looking  at  Multivariate  Data"* 
most  attendees  and  authors  have  Interpreted  this  to  mean  looking  at  multi¬ 
variate  data  "using  graphical  methods".  Despite  the  fact  that  the  title 
of  the  present  paper  contains  the  words  "data"*  "multivariate",  and 
"graphs”,  we  shall  break  step  with  these  other  authors  and  describe  a 
class  of  multivariate  network  problems.  We  would  have  liked  to  address 
these  problems  using  graphical  methods  but, for  the  moment, we  have  been 
forced  to  settle  for  a  more  traditional  multivariate  model-based  approach. 
This  may  seem  even  more  surprising  since  the  network  problems  we  address 
begin  with  data  that  correspond  to  a  picture  or  graph. 

—  Figure  1  goes  about  here  — 

Figure  1  contains  an  example  of  a  univariate  directed  graph,  a  graphi¬ 
cal  representation  of  a  network  involving  g  *  6  Individuals.  There  are 
g(g-U  a  30  possible  arrows  or  directed  arcs  llnklnq  these  6  Individuals 
in  pairs,  only  12  of  which  are  present  In  Figure  1.  The  Information  In  a 
univariate  graph  for  g  Individuals  can  be  summarized  by  means  of  a  g  x  g 
adjacency  matrix  x,  with  elements 


1  If  1  relates  to  j 
0  otherwise. 


where,  by  convention,  the  diagonal  terms,  x^,  are  set  equal  to  zero. 
The  adjacency  matrix  for  Figure  1  Is: 
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1  2  3  4  5  6 


I 

] 


0  10  110 
10  10  10 
0  0  0  0  1  1 
0  1  0  0  0  0 
0  10  10  1 
0  0  0  0  0  0 


There  are  several  approaches  that  we  might  adopt  to  model  the 

data  In  the  adjacency  matrix*  x.  For  example,  we  might  focus  on  the  6 

Individuals  and  assume  that  Individual  1  makes  5  possible  Independent 

choices  (corresponding  to  arrows),  with  some  unknown  Bernoulli  parameter, 

p,  (1  ■  1,2,..., 6).  Then  a  suitable  data  summary  would  be  the  row  totals 
*  • 

of  x,  i.e.,,  (3, 3,2,1 ,3,0).  The  assumption  of  Independence  of  choices 

mt 

Is  not  likely  to  be  satisfied  In  practice,  however.  Alternatively,  we 

might  focus  on  the  6  x  5  «  30  pairs  of  Individuals,  and  assume  that  the 
# 

data  for  the  pairs  are  Independent  and  Identically  distributed.  In 
effect,  then,  we  would  choose  to  focus  on  relationships,  and  would  observe 
three  different  types: 


© 


© 


©5= 


X 

a® 


Null, 

Asymmetric, 
Mutual . 


Thus  the  observed  data  would  be  sunmarlzed  In  the  following  2x2  table: 


Send  Choice 


Receive  Choice 


Yes 

No 

Yes 

4 

8 

12 

No 

8 

10 

18 

12 

18 

30 

Note  that  each  pair  has  been  counted  twice,  once  for  "sending"  and  once 
for  "receiving",  thus  merging  the  asymmetric  relationships. 

The  approach  Involving  pairs  essentially  uses  the  g(g-l)  permutations 
of  the  g  Individuals,  two  at  a  time,  and  thus  leads  to  a  doublecounting 
of  each  pair.  By  focussing  on  the  |||  *  g(g-l)/2  combinations  or  dyads, 

we  can  eliminate  the  doublecounting  and  obtain  the  following  sunmary: 


© 

© 


© 

<D 


Q<  =*© 


No.  of  Dyads 
5 
8 
2 


In  this  paper,  we  consider  stochastic  models  of  multivariate  directed 
graphs.  Involving  several  types  of  arrows  or  relationships,  that  treat  the 
^||  dyads  as  Independent  random  variables.  We  do  this  in  the  full  know¬ 
ledge  that  for  most  network  problems  dyads  are  constructs.  We  do  not 
sample  them.  Rather,  If  we  sample  at  all,  we  take  a  sample  of  Individuals 
and  we  measure  Information  on  dyadic  relationships.  The  Independence  of 
dyadic  Information  Is  an  assumption  which  In  practice  Is  In  need  of  some 
verification.  We  do  not  address  this  Issue  In  this  paper.  For  population 


directed  graph  data,  consisting  of  the  dyad  Information  for  all  of  the 
Individuals  In  a  network,  the  use  of  stochastic  models  leans  for  support 
on  (.&)  randomization  arguments,  (b)  superpopulation  Ideas,  or  (c)  It 
simply  provides  a  convenient  framework  for  exploratory  data  analysis. 

In  the  next  section  we  describe  a  set  of  network  data  Involving 
organizations  and  three  types  of  organizational  relations.  Then,  In 
Section  3,  we  describe  a  class  of  models  and  multivariate  methods  for  the 
analysis  of  such  data,  which  treats  the  organizations  as  a  single  group. 

After  fitting  these  models  to  the  data  In  Section  4,  we  further  develop 
the  models  In  Section  5  to  allow  for  disaggregation  of  the  organizations 
Into  subgroups.  We  conclude  by  returning  to  the  graphical  theme  of  this 
conference,  and  suggest  some  extensions  of  our  modelling  approach  which 
might  lead  to  Interesting  graphical  summaries. 

2.  A  Specific  Network:  Towertown,  U.S.A. 

The  data  that  have  motivated  our  work  on  this  topic  come  from  a 
study  of  109  formal  organizations  0*  1th  more  than  20  employees!  In  a 
small  midwest  United  States  conrnunlty  of  32,000  persons,  referred  to  by 
the  pseudonym  "Towertown".  Galasklewlcz  (1979)  described  the  survey  of 
Towertown,  Galasklewlcz  and  Marsden  (1978)  report  on  the  data  considered 
here,  and  we  have  described  the  data  elsewhere  In  detail  (Fienberg  and 
Wasserman,  1981).  For  the  present.  It  will  suffice  to  note  that  we  are 
concerned  with  the  results  of  questionnaire  data  for  a  subset  of  73 
organizations,  representing  the  ties  between  pairs  of  organizations  for 
three  types  of  relations:  Cl)  Information,  (11)  money,  and  (111)  support. 
This  data  set  can  then  be  represented  as  a  multivariate  directed  graph, 
summarized  by  three  adjacency  matrices  defined  for  the  same  73  organizations. 
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(>1»  X£.  Xj).  Each  matrix  Is  of  dimension  73  x  73  and  represents  73(73-1) 

■  5256  possible  directed  arcs  using  0's  and  1’s.*  Given  the  size  of 
these  matrices,  It  should  not  be  surprising  that  graphical  representations 
of  even  the  univariate  links  are  too  complex  to  comprehend. 

Thus,  we  still  need  a  way  to  look  at  and,  perhaps  more  importantly, 
summarize  the  data.  Table  1  contains  one  such  summary  of  the  data  given 
by  Galasklewlcz  and  Marsden  (1978),  In  the  form  of  a  26  table  of  counts  of 
pairs  of  organizations.  This  table  gives  the  direct  multivariate  generalization 
of  the  2  x  2  representation  for  a  single  relation  given  In  Section  1.  Each  pair 
of  organizations  is  counted  twice,  once  from  the  perspective  of  each 
member.  Thus .the  total  of  the  counts  In  the  table  Is  5256,  twice  the 
number  of  pairs,  g3]  *  2628*  Henceforth, we  refer  to  Table  1  as  the 
w-table  with  entries  {w^, jj'kk'** 

—  Table  1  goes  about  here  ~ 

The  26  cells  of  Table  1  consist  of  (a)  8  cells  whose  counts  are 
doubled,  and  (b)  28  cells  whose  counts  are  duplicated.  If  we  eliminate 
the  ‘duplication  and  doubling  of  counts.we  get  an  arrangement  of  36  cells, 
whose  counts  correctly  total  2628.  In  Table  2  we  give  one 
possible  representation  of  these  36  cells  in  a  form  resembling  a  three- 
dimensional  4x4x4  cross-classification,  where  the  three  "variables" 
correspond  to  the  three  relations  (1)  Information,  (2)  money,  and  (3)  support. 

—  Table  2  goes  about  here  — 

^Throughout  this  paper  we  work  with  summaries  of  this  data  set . 

The  full  data  set,  consisting  of  three  adjacency  matrices  and  pseudonyms 
for  the  organizations.  Is  available  on  request  from  the  authors. 
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When  the  dyadic  structure  for  a  single  relation  Is  asymmetric,  the 
"direction"  of  the  corresponding  arc  does  not  matter.  We  use  a  single 
subscript.  A,  to  denote  the  relation  in  such  situations.  When 
the  dyadic  links  for  two  or  more  relations  are  both  asymmetric,  we  need  to 
dfstlnguish  between  situations  where  the  arcs  for  a  pair  of  relations 
go  In  the  same  or  different  directions.  Thus,  for  these  situations,  we 
use  two  different  subscripts,  A  and  A,  with  Identical  subscripts  for 
those  relations  whose  asymmetric  directed  arcs  go  In  the  same  direction. 
We  arbitrarily  assign  the  subscript  A  to  the  lowest  numbered  asymmetric 
generator.  (Note  that  Interchanging  the  subscripts  A  and  A  yields  the 
same  dyadic  structural  relationship.)  We  denote  the  observed  counts  In 
Table  2  by  zabc»  for  a,  b,  c  3  M,  A,  A,  N  (for  Mutual,  Asymmetric, 
Symmetric,  and  Null),  where  the  convention  for  the  use  of  the  sub¬ 
scripts  A  and  A  is  as  described  above.  These  observed  counts  can  be 
thought  of  as  realizations  of  a  set  of  random  variables,  {Zabc> •  whose 
probability  structure  we  wish  to  model. 


3.  Log! Inear  Models  for  Multivariate  Directed  Graphs 

We  wish,  to  model  the  probability  pabc  that  a  randomly  selected  dyad 
would  be  assigned  to  cell  (a,b,c)  In  Table  1,  where 


C3.1) 


E  P»bc  " 
all  cells  aDC 


Although  we  might  think  of  using  logl inear  models  directly  for  the  Cp#5C)» 
such  an  approach  leads  to  difficulties  of  Interpretation  (see  Flenberg 
and  Wasserman,  1980, for  further  details).  Instead,  we  define 
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(.3.2) 


If  a,  b,  and  c  are  each  equal  to  M  or  N, 
tf  one  of  a,  b,  and  c  equals  A. 


Our  plan  Is  to  develop  a  class  of  linear  models  for  the  {£abch  which 
for  (Pabc)  yields  an  affine  translation  of  a  class  of  logllnear  models 
(see  Chapter  9  of  both  Haberman,  1974  and  Haberman,  1979).  This  approach 

(a)  treats  dyads  Involving  asymmetric  ties  as  having  been  produced 
with  an  orientation  and  then  pooled.  (This  also  accounts  for 
the  divisor  of  2  for  counts  Involving  asymmetric  ties.) 

(b)  Includes  as  a  special  case  the  model  of  Independent  Individual 
choices  (see  the  discussion  of  Section  1). 

Cc)  Is  directly  related  to  an  approach  of  Holland  and  Lelnhardt 
(1980)  which  allows  for  parameters  associated  with  the  Indivi¬ 
duals  in  the  dyad  (see  also  Flenberg  and  Wasserman,  1981). 

We  plan  to  consider  models  for  the  Uabc)  which  are  linear  In  parameters 
tha^  reflect  the  13  distinct  types  of  dyadic  patterns  depicted  in  Figure 
2.  Note  that  the  patterns  have  a  hierarchical  structure.  For  example, 
the  six-arrow  full  symmetry  pattern,  (xill),  contains  all  the  other 
patterns  as  special  cases,  and  the  conditional  multiplex  mutuality  pattern, 
(xll),  contains  patterns  (1)  through  (xi)  as  special  cases.  We  consider 
a  class  of  Increasingly  complex  logllnear  models  for  the  (Cabc}  with 
parameters  based  on  the  patterns  In  Figure  2. 

—  Figure  2  goes  about  here  — 

(I)  The  null  model  corresponding  to  Figure  2(1)  depicts  the 
probabilities  (Pabc)  as  being  constant,  and  could  be  represented  as 

*abc  * 
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where  8  *  log(l/36).  This  Is  an  Individual,  Independent  Bernoulli  choice 
model.  For  subsequent  models  we  use  8  as  a  normalizing  constant. 

err)  At  the  next  level,  we  add  choice  parameters  ,{0-|,82,83}  for  the 

relations  (Figure  2(H)),  one  for  each  directed  arc.  For  example, 

5MAH  ■  8  *  281  *  e2 

«MAA  ■  6  +  291  *  h  +  «3 

-  8  ♦  28,  +  e2  +  e3. 

(Ill)  Next, we  add  sets  of  parameters  corresponding  to  heightened 
or  diminished  effects  related  to  pairs  of  directed  arcs: 

(a)  p-|-| ,  p12#  Pj3  for  mutuality  effects  (see  Figure  2(111)), 

(b)  P12»  P-J3*  p23  for  exchange  effects  (see  Figure  2(1  v)), 

(c)  8^2,  e13,  823  for  multi pi exity  effects  (see  Figure  2(v)), 

For  Example: 

^AA  *  8  +  28,  ♦  82  +  83  ♦  Pn  ♦  P12  ♦  P13  +  P23  ♦  e12  ♦  0l3» 

*  0  +  2e-j  +  02  +  203  +  Pn  +  P33  +  P-|2  +  2P-|3  +  P23 

+  e12  ♦  2813  +  e23. 

There  are  additional  sets  of  parameters  corresponding  to  the  remaining  4 
levels  In  Figure  2.  At  level  IV,  one  of  these  parameters  Involves  only  multi- 
plexlty  and  thus  Is  denoted  by  a  triple  subscripted  0,  l.e. ,  8123.  The 
remaining  parameters  Involve  mixtures  of  mutuality,  exchange,  and  mul ti¬ 
pi  exity,  and  are  denoted  by  subscripted  (p8)’s.  Overbars  on  subscripts  are 
used  to  distinguish  asymmetric  directed  arcs  going  In  opposite  directions. 


e.g. ,  (pe).,^. 

The  parameters  tn  this  class  of  models  are  GLIM-1 Ike  In  structure 
(e.g. ,  see  Nelder  and  Wedderbum,  1972),  In  that  a  parameter  Is  Included 
In  the  model  if  and  only  if  the  corresponding  effect  Is  present.  The 
entries  of  the  resulting  “design  matrix"  for  the  parameter  structure  for 
any  given  model  will  be  0's,  Vs,  and  2's.  This  particular  problem  could 
be  handled  In  GLIM  directly  only  through  the  explicit  construction  of 
this  design  matrix,  which  Is  a  formidable  task. 

The  parameters  have  a  hierarchical  structure,  l.e. ,  If  we  set  some 
parameters  equal  to  zero,  all  related  higher-order  terms  are  also  zero. 
For  example, 

912  "  0  **  0123  "  ^p0hl2  *  ^p0^221  *  ^p0^3l? 

"  *p0hl23  *  *p0,nZ3  *  (p0)2213 
"  tp0)22TJ  “  (p0hl22  "  (p0)11223 
.  *  (p0) 11332  *  {p0)22331 

"  (p0hl2233  ’  °* 

and 

pn  “  0  "*  ^p0)112  “  (p0)ii3  ■  (p®)ii23  *  (pehl2T 

Cpe)n22  *  (p0)  1^33  *  (P0)n223 

“  ^p0  hi  332  "  (p0hl2233  *  0> 

In  the  next  section  we  discuss  how  to  fit  these  models  to 


social  network  data. 
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4.  Fitting  the  Models  to  Data 

Fitting  the  log! inear  models  of  the  preceding  section  to 
data  in  Table  2  follows,  in  principle,  directly  from  the  general  results 
for  loglinear  models  In  Haberman  (1974)  or  Appendix  II  of  Flenberg  (1980). 
The  minimal  sufficient  statistics  (MSS's)  take  the  form  of  linear  combina¬ 
tions  of  the  {zabc> , 


(4.1) 


z 

all  cells 


aabc  zabc* 


where  for  a  MSS  corresponding  to  "generic"  parameter,  B, 


(4.2)  aabc  *  mult1Ple  of  B  in  5abc* 

The  multiples  of  all  parameters  are  either  0,  1,  or  2,  and  thus  all  of  the 
a's  are  either  0,  1,  or  2. 

If  we  let  the  expected  value  for  the  (a,b,c)  cell  be  mabc  ■  N*pabc 
where  N  =  (ij  *  then  the  Hkel^0°d  equations  are  found  by  setting  the 
MSS's  equal  to  their  estimated  expected  values,  l.e.,  for  a  generic 
parameter  the  likelihood  equation  Is: 


(4*^)  ..  ^  ..  aabc  mabc  *  ..  ^  aabc  zabc* 
all  cells  "  w  a  all  cells 

Vte  can  solve  a  set  of  likelihood  equations,  each  of  the  form  (4.3),  by 
using  a  version  of  the  generalized  Iterative  scaling  algorithm  due  to 
Oarroch  and  Ratcliff  (1972),  with  starting  values  as  fellows: 


(4.41 


if  a,  b,  and  c  are  each  equal  to  M  or  N 
if  one  or  more  of  a,  b,  and  c  equals  A. 


There  are  two  drawbacks  to  this  approach.  First,  one  needs  to  work  with 
data  arrays  of  the  irregular  shape  of  Table  2.  Second,  the  convergence 
of  generalized  iterative  scaling  can  be  excruciatingly  slow. 

All,  however.  Is  not  lost.  Two  results,  one  simple  and  one  relatively 
complex,  lead  us  to  a  very  straightforward  alternative  approach  for 

A 

computing  the 

Result  1:  For  the  class  of  affine  translations  of  hierarchical 
logl Inear  models  described  In  Section  3,  each  set  of  MSS's  Is 
equivalent  to  a  set  of  marginal  totals  for  the  2®  table  (l.e., 
the  w-table)  with  doubled  and  duplicated  counts. 

For  example,  the  simple  model  with  only  a  choice  parameter,  8^,  and  a 
mutuality  parameter,  p,-j,  for  the  first  relation  has  MSS's  {zMl , .  z^, , . 

W’and 

(4.5)  ZM++  "  **  wn++++* 

*  ZA++  *  *10++++  "  w01++++* 

ZN++  *  **  *00++++’ 

Result  2:  For  each  affine  translation  of  a  loglinear  model 
for  the  z-table,  there  Is  a  corresponding  loglinear  model 
for  the  w-table,  with  equivalent  estimated  expected  values, 
once  we  take  account  of  the  duplication  and  doubling. 

For  example,  for  the  model  with  choice  and  mutuality  parameters,  l.e.. 


(4.6) 


{8»8| »®2’®3,p11 ,p22*p33^ * 
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tne  corresponding  logltnear  model  for  the  w-table  that  yields 
equivalent  MLE’s  Is,  in  GLIM-1 Ike  notation: 

(4.7)  log  *  I  ^  ^1^1  ^  • 

+  Xgfij  +  X2,6j. 

+  X3dk  ♦  X3,5k, 

+  X11<16{,  + 

+  X336k6k‘  * 

Here  is  the  expected  value  for  the  (1.1* ,J,J* ,k,k')  cell, 

and  each  6-term  equals  1  If  the  subscript  takes  the  value  1,  and  Is  zero 
otherwise. 

To  understand  Result  2  we  need  to  note  the  following  correspondences 

between  the  w-table  and  the  z-table: 

**»  «■» 

w-table  z-table 

«** 

'Cell:  CM' JJ’.k.k')  (a.b.c) 

Symmetric  flows:  1  •  1',  j  ■  j'.  k  ■  k'  a.b.c  ■  M  or  N 

Because  of  the  doubling  of  the  counts  In  Table  1,  we  have: 

II og  (2  maJ)C)  for  synmetrlc  flows, 

log  (m#bc)  for  asynmetrlc  flows. 
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Substituting  expression  (3.2)  into  (4.8)  and  noting  that  m#bc  -  |||  P#bc* 
we  get 

(4.9)  log  •  [2  j|jj  ♦ 

Thus  the  models  for  log  ">11  •  j j * kk*  an<*  ^abc  by  only  *  constant. 

A  direct  consequence  of  these  two  results  is  that  we  can  compute 
MLE's  for  the  expected  values  under  the  models  of  Section  3  using  standard 
Iterative  methods  for  contingency  tables.  (This  Is  In  fact  what  Galasklewlcz 
and  Marsden  (1978)  did  In  their  original  analyses  of  Table  11).  For 
example,  for  the  model  with  parameters  given  by  (4.6),  the  MSS's  are 
equivalently  given  by  the  two-way  marginal  totals  of  the  w-table: 

^*1 1 '  ++++*  •  **++j  j  *  ++*  *  Wf+Hck '  * 

These  marginals  can  be  fit  to  the  2®  table  using  the  standard  Iterative 
proportional  fitting  procedure  (or  some  other  program  such  as  GLIM). 

Because  of  symmetries  In  marginal  totals,  e.g. , 


*10++++  *  *01++++’ 

*++10++  *  *++01++’ 

*++++10  *  *++++01* 

the  resulting  parameter  estimates  are  such  that 

A  A  A  A  A  A 

^1  *  I  »  Xa  *  ^91  •  • 


# 


The  estimated  parameters  for  the  models  for  £abc  can  be  computed  directly 
from  these  parameters: 


A  A 

6  *  X  -  log 

['  Is  1] 

A  A 

6,  •  X, 

1  •  1,2,3 

A  A 

*11  *  X11 

1  -  1,2,3 

Me  note  that  the  d.f.  for  any  model  must  be  calculated  using  the  model 
for  the  z-table,  not  the  one  for  the  w- table,  and  the  value  of  any  standard 
goodness-of-flt  statistic  computed  directly  on  the  fitted  w-table  must 
be  divided  by  2. 

5.  Initial  Analyses  of  the  Towertown  Data 

In  Table  3  we  list  a  set  of  seven  logllnear  models  that  we  have  fit 
to  the  Galasklewlcz-Marsden  data  of  Table  1  tsome  of  these  models  correspond 
to  ones  fit  by  Galasklewlcz  and  Marsden).  The  first  six  models  are  of 
Increasing  complexity,  and  only  the  most  complex  of  these  models,  (6),  pro¬ 
vides  a  fit  which  it  not  significant  at  the  0.05  or  even  0.01  level. 

Model  (7)  Is  a  compromise  between  models  (5)  and  (6)  that  drops  one  of  the 
conditional  mutuality  and  two  of  the  multiplex  mutuality  effects  but  still 
provides  an  acceptable  fit  to  the  data.  Its  parameter  estimates  are  listed 
In  Table  4. 

—  Tables  3  and  4  go  about  here  ~ 

The  most  substantial  estimated  effects  (In  terms  of  magnitude)  are 

A  A 

those  associated  with  choice  (0^'s),  mutuality  (p^'s),  conditional 
mutuality  (p0)33j  ■  -2.15  and  multiplex  mutuality  (pe)^33  »  2.88.  Inter¬ 
preting  these  effects  Is  complicated.  For  all  hierarchical  models,  with 
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nonorthogonal  designs,  the  parameters  that  are  easiest  to  Interpret  are 
those  associated  with  the  hlghest-order  effects.  Here  the  multiplex  mutuality 
parmaeter  estimate  Implies  a  heightened  likelihood  of  simultaneous  recipro¬ 
cation  of  both  Information  and  support,  relative  to  what  we  would  expect  In 
a  model  without  the  multiplex  mutuality  parameter. 

One  of  the  major  difficulties  with  the  models  of  Section  3  Is  that 
dyads  are  considered  to  be  homogeneous  and  thus  do  not  allow  for  the  Inherent 
differences  among  the  organizations.  Without  some  allowance  for  this  hetero¬ 
geneity,  further  Interpretation  of  fitted  models  makes  little  sense.  In 
Table  5  we  list  pseudonyms  for  each  of  the  73  organizations,  and  provide  a 
partition  of  them  Into  four  sub-groups: 

1.  Business  g-j  »  16  , 

2.  Political  g2  -  24  , 

3.  Nonprofit  voluntary  associations  g3  ■  21  , 

4.  Nonprofit  service  associations  g^  ■  12  . 

We  postulate  that  the  sociological  factors  affecting  Interaction  should  be 

% 

relatively  homogeneous  within  these  groups.  Thus,  we  can  categorize  the 
original  |||  *  j^j  *  2628  dyads  Into  the  cells  of  an  upper  triangular 
4x4  array: 


No.  of  Dyads: 


S,  Gj 

S3 

s« 

120  384 

336 

192 

276 

504 

288 

210 

252 

66 

For  each  cell  In  this  array  there  Is  a  2°  table. 
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—  Table  S  goes  about  here  — 

Within  each  of  the  four  groups  we  can  analyze  flows  using  a  26  table 

and  the  models  from  Section  3.  These  2®  tables  have  the  same  doublings 

and  duplications  as  the  aggregated  26  table.  The  flows  between  groups 

(In  pairs)  now  have  an  orientation  and  there  are  corresponding  26  tables 

describing  these  flows  which  contain  no  doubling  and  no  duplication.  We 

can  analyze  each  of  these  tables  with  standard  logl Inear  models  that 

parallel  those  models  for  within  group  flows.  The  total  number  of  cells 

In  the  full  table  is  (4  *  36)  ♦  (6  x  64)  -  528. 

In  Table  6,  we  report  the  result  of  fitting  separate  multiplex  mutuality 

models  (model  (6)  of  Table  3)  to  each  of  the  10  26  arrays.  While  this  model 

2 

fits  extremely  well  (G  is  less  than  the  d.f. ),  this  Is  In  large  part  the 
result  of  fitting  352  parameters.  An  alternative  modelling  approach  links 
the  within  and  between  group  models.  For  example,  we  might  take  a  common 
"interaction  structure"  for  all  10  2®  tables,  but  allow  only  the  choice 
parameters  (the  e^'s)  to  depend  on  groups.  The  result  Is  model  (2)  In  Table 

5,  whose  fit  Is  not  horrid  but  Is  still  significant  at  the  0.005  level.  A 

compromise  between  models  (1)  and  (2)  of  Table  3  would  have  a  comnon  model 
« 

for  wlthln-group  flows  and  a  separate  variant  on  model  (2)  for  between- 
group  flows.  We  report  the  fit  of  two  such  models  In  Table  6.  Model  (3b) 
fits  extremely  well,  and  provides  a  convenient  starting  point  for  further 
analyses  of  the  data. 

—  Table  6  goes  about  here  — 

6.  A  Possible  Graphical  Display  for  Multivariate  Directed  Graphs 

The  second  set  of  analyses  of  the  preceding  section  leads  quite 
naturally  to  analyses  Involving  a  further  dl saggregatfon  of  organizations. 
Indeed  we  could  carry  the  disaggregation  to  the  limit,  with  each  organize- 


tion  forming  Its  own  group  of  one.  We  could  postulate  models  with 
different  choice  parameters  for  each  organization  and  a  comon  higher* 
order  parametric  structure.  Actually,  we  would  end  up  with  Individual 
sending  and  receiving  parameters  for  each  organization  and  each  relation. 

The  resulting  model  is  in  the  same  spirit  as  the  bivariate  models  sug¬ 
gested  by  Holland  and  Lelnhardt  (1980). 

The  attractive  feature  of  this  fully-disaggregated  approach  Is  that 
we  can  examine  the  estimated  higher  order  structure  In  a  tabular  form 
similar  to  that  of  Table  4,  and  look  separately  at  the  estimated  Individual 
parameters.  The  latter  can  be  displayed  In  a  set  of  three  overlayed 
"correspondence-like"  plots  of  the  73  organizations.  The  sending  and 
receiving  parameter  estimates  for  an  organization  could  be  used  as  the 
abscissa  and  ordinate  for  a  corresponding  point,  and  the  three  points  for 
different  relations  could  be  linked  to  form  a  triangle.  This  plot  should 
show  not  only  the  clustering  of  organizations  but  also  the  similarities 
of  their  behavior  with  regard  to  the  three  different  relations  being 

considered.  We  have  stopped  short  of  producing  the  plot  for  the  Towertown 

% 

data  for  computational  reasons.  The  Iterative  methods  used  here,  and  In 
Flenberg  and  Wasserman  (1981)  for  the  univariate  version  of  the  disaggre¬ 
gated  model ,  when  applied  to  the  Towertown  data  simply  take  up  too  much 
computing  storage.  We  hope,  however,  that  alternative  computational  methods 
currently  under  development  might  make  possible  some  graphical  displays 
for  multivariate  directed  graphs  In  the  not-too-dlstant  future. 
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FIGURE  2.  PATTERNS  OF  FLOW  DEPENDENCY  IN  DYADIC  PATTERNS 
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T*U  1.  OBSERVED  DISTRIBUTE*  Of  HIDMMII0IHM.  TRANSACTIONS  INVOLVING  THREE  REUTIONS  AND  73  ORGANIZATIONS 
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g2  is  the  log-likelihood  ratio  chi-square  goodness -of-f it  statistics. 


TABLE  4.  ‘PARAMETER  ESTIMATES  FOR  MODEL  (7)  FITTED  TO  THE  DATA  FROM 
TABLE  1 


Parameter 


Estimate 


TABLE  5.  PARTITION  OF  73  ORGANIZATIONS  INTO  4  GROUPS 


TABLE  6.  MODELS  FIT  TO  THE  10  26  TABLES  FORMED  BY  THE  PARTITION  OF  THE 
73  ORGANIZATIONS  INTO  THE  4  GROUPS  GIVEN  IN  TABLE  8  (528  CELLS) 


Model 

G2 

D.F. 

(1) 

Separate  models  for  each  2 6  table,  each 
based  on  all  multiplex  mutuality  and 

Implied  lower-order  terms 

136.0 

176 

(2) 

A  cotnnon  Interaction  structure  for  all  26 
tables,  based  on  all  multiplex  mutuality 
and  Implied  lower-order  terms,  but  one- 
factor  choice  parameters  (6^ is)'3epend1ng 
on  the  groups  1 

629.0 

482 

(3a) 

A  common  multiplex  mutuality  model  for 
within  group  flows  plus  a  between  group 
model  similar  to  model  2 

409.0 

352 

(3b) 

Model  (3a)  plus  a  set  of  "information" 
multiplex  parameters  (d^)  for  between 

groups  that  depend  on  the  groups 

355.7 

343 

ECURITV  CLASSIFICATION  OF  THIS  PACE  fWhmn  D«r 


REPORT  DOCUMENTATION  PAGE 


Stephen  E 
Michael  M. '/Meyer 
Stanley  S . / Wasserman 


READ  INSTRUCTIONS 
BEFORE  COMPLETING  FORM 


J./recipient's  catalog  number 

Hr 


5.  TYPE  OF  REPORT  ft  PERIOO  COVERED 

TR,  to  July  1980 


ft.  performing  org.  report  number 

TP  #185 


10.  PROGRAM  ELEMENT.  PROJECT.  TASK 
AREA  ft  WORK  UNIT  NUMBERS 


II.  CONTROLLING  OFFICE  NAME  ANO  AOORESS 

Contracts  Office 
Carnegie-Mellon  University 


(Li 


i  ii u  iiiayii 


u.  MONITORING  AGENCY  NAME  a  AOORESS^//  dlltoroni  from  Control  ling  OUfco)  I  15.  SECURITY  CUASS.  (of  thlo  report; 


1  3^ 


Unclassified 


CCCL  ASSl  PI  CATION/  DOWNGRADING 
SCHEOULE 


»6-  DISTRIBUTION  STATEMENT  (of  thlo  Report) 


APPROVED  FOR  PUBLIC  RELEASE:  DISTRIBUTION  UNLIMITED. 


17.  DISTRIBUTION  STATEMENT  (of  tho  oboiroct  entered  In  Slock  20,  ti  dllforont  from  Roport) 


19.  KEY  WOROS  (Contlnuo  on  reverse  »ldo  It  neceeeery  end  Idontlty  by  block  number) 


multivariate  directed  graph,  social  networks,  stochastic 
loglinear  models 


20.  A*?si  R ACT  r Contlnuo  on  reverse  tido  H  neceeeery  end  Idontlty  bv  block  number; 


A  multivariate  directed  graph  consists  of  a  set  of  g  nodes, 
and  a  family  of  directed  arcs  (one  for  each  relation)  connectin 
pairs  of  nodes.  Such  multivariate  directed  graphs  provide 
natural  representations  for  social  networks.  In  this  paper 
we  consider  methods  to  analyse  a  network  of  73  organizations 
in  a  Midwest  American  community  linked  by  three  types  of 


DO  ,  1473  EDITION  OF  I  NOV  AS  15  OBSOLETE 


'  'i  : -cl-  .=• :: j-  vsoi 


SECURITY  CLASSIFICATION  OF  TniS  PACE 


3*1190 


security  classification  of  tvms  pace  cn>mi  c.  , 


relations:  information,  money,  and  support.  The  resulting 
data  set,  described  by  Galaskiewicz  and  Marsden  (1978), 
involves  3  x  73  x  72  =  15,768  possible  arcs  or  "observations". 
We  describe  a  class  of  stochastic  loglinear  models  for  multi¬ 
variate  directed  graphs,  demonstrate  how  they  can  be  fit  to 
the  data  using  generalized  iterative  scaling  of  Darroch  and 
Ratcliff  (1972) ,  and  explain  the  connection  between  these 
models  and  variants  on  standard  loglinear  models  for  multi¬ 
dimensional  contingency  tables  discussed  by  Bishop,  Fienberg, 
and  Holland  (1975) .  We  also  consider  a  disaggregation  of  the 
organizations  into  sub-groups,  and  demonstrate  how  to  adapt 
our  models  to  explore  the  intra-  and  inter-group  relationships 
These  methods  generalize  research  of  Holland  and  Leinhardt 
(1980) ,  who  develop  a  model  for  dyadic  relationships  in  uni¬ 
variate  directed  graph  data.  The  paper  includes  a  detailed 
analysis  of  the  Galaskiewicz-Marsden  data. 


